Figure 1 



Anopheles gambiae arrestin 1 cDNA sequence (SEQ ID NO: 1) 



5 

ACAGGAACGACGGTTGTGATCCCTCCACTGGTGGTGACACGAATCATAAGCATTATTTCATACCT 
AAAAAACAAAATCTACAAAAAAAAGCTTCATTCCCATCGAAAAAACTTTCTTGTGAAATCAACCG 
AGCTAACAAACAACATCCTGTGCAAAATCTAGC AGTGAAAGTGTGATATCGTATAC CTGTAC CTG 
TAAACCGTTGTGCGCGTGTGTGCCTTTGTGTATCAATTTTGTGGAAAACAGAAAATACATCAAAA 
10 TGGTTTACAATTTCAAAGTCTTCAAGAAGTGCGCCCCTAATGGAAAGGTTACGCTGTACATGGG 
CAAGCGTGACTTTGTAGACCACGTTTCCGGCGTTGAACCGATCGATGGTATCGTCGTCCTCGAT 
GATGAGTACATTCGTGACAACCGTAAGGTATTCGGTCAGATTGTCTGCAGTTTCCGCTACGGCC 
GCGAAGAGGACGAGGTGATGGGACTAAACTTCCAGAAGGAGTTATGCCTCGCTTCCGAACAGAT 
CTACCCGCGTCCGGAAAAGTCGGACAAGGAGCAGACCAAGCTCCAGGAGCGACTGCTGAAGAA 
15 GCTGGGTTCGAACGCCATCCCGTTCACGTTCAACATCTCGCCGAATGCTCCGTCTTCGGTCACG 
CTGCAGCAGGGCGAAGATGATAATGGAGACCCGTGCGGTGTGTCGTACTACGTGAAGATCTTTG 
y- CCGGTGAGTCGGAAACCGATCGTACGCACCGTCGCAGCACCGTTACGCTCGGCATACGCAAGAT 
Q CCAGTTCGCACCGACCAAGCAGGGCCAGCAGCCGTGCACGCTGGTGCGCAAGGACTTTATGCTA 

n agcccgggagagctggagctcgaggtcacactagacaagcagctgtacctgcacggggagcga 

im ataggcgtcaacatctgcatccgcaacaactcgaacaaaatggtcaagaagattaaggccatgg 

Sp! tccagcagggtgtggatgtggtgctgttccagaatggtagctaccgcaacacagtggcatcgct 

j» ; ggagactagcgagggttgcccaattcagcccggctccagtctgcagaaggtaatgtacctcacg 

?; ccgctgctgtcctcgaacaagcagcgacgtggcatcgccctggacggtcagatcaagcgtcagg 

?5j atcagtgtttggcctcgacaaccctcttggctcaaccggatcagcgagatgctttcggcgttat 

*25 catatcgtatgccgtaaaggttaagcttttcctcggcgcactcggcggcgagctgtcggcggaa 

Ik CTTCCATTTGTGCTGATGCACCCAAAGCCCGGCACCAAGGCTAAGGTCATCCATGCCGACAGCC 
r : r AGGCCGACGTAGAAACTTTCCGACAGGATACAATCGACCAGCAGGCATCAGTTGACTTTGAATA 
= : GACGACGCAACGGTTTGGAAATGCTACCTACTACCCCAGGCATGGGCTAACACGACGAACGAAC 
* i! TACTACTACTAAGCATAAAAAACAGGAAAAAAAATGGAAAACTTAAAAAATGGATCATACAACCG 
I§0 AACGCAAACGACCTACGACGATCGATCTCACTTCCCCGTCTTTTTCATCCTAAGCAATAGAACGA 
!f j TGGTAGAAAAGGAAGATAAAGATGGAGAGAAAGTCACGTGTATCAATGACGACGACTACCAAAA 
111 CTGAAGACGTAACACATGTTCCCCAGCGAGCGGTAACTGTTCTGTTCTGACACCTTCCGCTCGA 
CAATGTACCTTTTAAAAACATACAAATTAGAAGTCGTCTTCACTACCTTCAACCAATCCAGCCAC 
TTTGGTATATACTTTTCATAGAATCCTTCTGAGCGCAAGGACCCTATTGAAATTCAGTGTTATTTT 
35 GTAACTGCGACCAAATGCCTAGCTGAATGTTGTTGAACGAGTTATGTACATCAAAAGATTGAATA 
AAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 2 

Anopheles gambiae arrestin 1 amino acid sequence (SEQ ID NO: 2) 



MWNFKWKKCAPNGKVTLYMGKRDFVDHVSGVEPIDGIVVLDDEYIRDNRKVFGQIVCSFRYGR 

EEDEVMGLNFQKELCLASEQIYPRPEKSDKEQTKLQERLLKKLGSNAIPFTFNISPNAPSSVTLQQG 

EDDNGDPCGVSYYVKIFAGESETDRTHRRSTVTLGIRKIQFAPTKQGQQPCTLVRKDFMLSPGELE 

LEVTIJDKQLYLHGERIGVMCIRNNSNKMVKKIKAMVQQGVDVVIJQNGSYRNTVASLETSEGCPI 

QPGSSLQKVMYLTPLLSSNKQRRGIAIJ)GQIKRQDQCIASTTLIAQPDQRDAFGVIISYAVKVKLFL 

GALGGELSAELPFVLMHPKPGTKAKVIHADSQADVETFRQDTIDQQASVDFE 



Figure 3a 



Anopheles gambiae odorant receptor 1 genomic sequence (SEQ ID NO: 9) 



Features: 

1) Presumed Untranslated 5 1 and 3' regions are underlined . 

2) Potential TATA box transcription initiation signal is double underlined . 

3) Putative Start (ATG) and Stop (TAA) codons are in BOLD. 

4) Intron s are tentativ ely assigned and are shown in lower case. 
Exons are 



AGCTTTGTTCATTTATGTTGAAATCTAGCCCATTTTGTATAGTGCTGAACGACGAAGAACATACGAAAGTACCTCGT 
CCGAACACTATCAACATTAATTATACCAAGCTAGAAGAAGATATTTATAGTCAAGCGTCAACATCATAGGAAACTTT 
AGCAAAAC CATTTAATTTACATGATGATAAGTC C CAC CTCTTAC CCC AGCACAGGTTTGAGAAGGACGAAAGTATCT 
TTACGATAATATTACTCTAAGGTAGTTTTTGAATAAAATAAAAATTTACGTGCAAGTGGTGGCATCGGACATCATTC 
GAAAGAATCTACTAAGTC ATACACACAC C CAAGACGACCGACGTAGTTTCATCTAGAAAAAACGGGTCAGCTC CATC 
GAACACGTCAGGACATAACTGCGACATGCGTATGGTCAGTTCCACTAGTGCCAACACTGGTTCCAGGGCACTACCTT 
CCGAAGCAGTAGAACCTAATGTATTGGAAATTATTAGGACATACTGCAACATGCATATGGCTAGTTCCGCTGGTACC 
AACGATGGCACCAGGACACTATCTGCGGCCTTGTAAAATCACTGTAAAATCTATACAAAAACGGCTTTACCCATACT 
TTATCACAAAAACGGCAGGTGAGGGCTGGATTGCTTCAAAGCATTAGAAATATATAATTTCAAAGTCCATAATCTCC 
TTAAAAGATAGACAaCAGTAGAGAACACATTTAGTGCTCTTTTCGTTCGAGTTAGTTGCCTTCTCAAGTAAGCGTTT 
AATGCTCAATTGTTGTAGATTCGTTGGATGACTCTCGCTACGTGCTATAGTGGTCAATACTTCCAATTAGATTTCAT 
AATTAGTTTCCAATTGTC C ACGGAAAACC C aCAAAAGAAAAAA AAACTTGTATCTAGGG 

GGATCCTTTg ttggatttca 



ATTGGACACTTCAT ATGAAAAAGGACAGCTTTTTCAAAATGTT^ 
attctccaaattctgcagaataattctgcaaattttacaaaactgctcaaccaccaataattccaattaatcatctg 
aacatttaaaactgataattaagatgagtaattgcttcgtcatcacctaagaaatcgattagtttggataaaaagaa 
caaattgaaatacaataaagtccctgaattttattcgaataacggcttgaactcatttatttcaaaaacctttgaga 
aattcctcgttgaaaattggtctcctatagttctgctaacgggccacttcaaaagcaagaactaacaaaatcataat 
tatggtgcaagtaactatcagtaccagtaatcgccattaaaaacttttcctcaatttgcggctcgttaccggctaaa 
tacagagcagagtaacgggaagtgatcaacgtcgctattagtataacgaggaacgccctccgaaggtgtgttgaagg 
accttttcaaattgaaaccaagtactgtttccagttttaaattggatagttataaaatgagccgttcaacgatcggg 
catcatttgagtttcatcttcgaggagaaatagatcagtgccactgtttaaccgaaagtaatgaagctgaacaaact 
gaacccacggtg gga tgcgtacgatcgacgggattcgttctggttgcagttgctttgtttgaaatatttagGCCTAT 

kTCGG^ 

aCCCTATACTTCAAGgATGTGAAGGATATTAATg t gag t c t c t agttagctattag 



tgttccacctgtccataatctgtcttttattgggt agGACATCGC AAATGC ATTGTTCGTGCTT ATGACTG AAGTGA 

^~C TAC AAC ATCGC AC 




tTAATATO^TA ^ 

jGlGCCCGC^ 

TAATCTGATAGGATATCTTGTTTATCCAATAGAGGTGTGGAAGCGTTCCCAAGCCATTCGTTTGATAGTTTATAGCA 
CCGTCGAGCAGTTGATCGCTGTGATCGCTAGGCGCACCTGATTTTATCTTTATCTCGCACCTGTTATGGCAAGGGCG 
CTTTTCACACGTTTCACACAATATAATGCACATGTATAATGCATTCTTACTTTAGCATTTTTGTTACATATAATACC 
AAAATTATGCATTTTTATTCTCACGCAACGATTAGAGGATGACTTcACAAAGGTCCATCTAGTGGTAGGAGGTATAC 
AATTATACCTCTCAAAATCTCACAGCAtAATGAGAAACAAAAGGATACCAAGCATACCCTTTTTTTACTTGACAATT 
TCATTTGATTTATGTAATAAAGCACTGCaCGTCGACTTCCTAAAA 



Figure 3a continued 



Figure 3b 

Anopheles gambiae odorant receptor 1 amino acid sequence (SEQ ID NO: 4) 



MKKDSFFKMLNKHRWILCLWPPEDTDQATRNRYIAYGWALRIMFLHLYALTQALYFKDVKDIND 

IANALFVLMTQVTLIYKLEKFNYNIARIQACLRKLNCTLYHPKQREEFSPVLQSMSGVFWLMIFLM 

FVAIFTIIMWVMSPAFDNERRLPVPAWFPVDYHHSDIVYGVLFLYQTIGIVMSATYNFSTDTMFSG 

LMLHINGQIWLGSMVKKLGHDWPERQLVATDAEWKEMRKRIDHHSKVYGTMYAKVTECVLF 

HKDILRIYLRASMRVCNYHLYDTAATTGGDVTMADLLGCGVYLLVKTSQVFIFCYVGNEISYTDKF 

TEFVGFSNYFKFDKRTSQAMIFFLQMTLKDVHIKVGSVLKVTLNLHTFLQIMKLSYSYLAVLQSM 

ESEZ 



Figure 4a 



Anopheles gambiae odorant receptor 2 genomic sequence (SEQ ID NO: 10) 



Features: 

1) Presumed Untranslated 5' and 3' regions are underlined , 

2) Potential TATA box transcription initiation signal is double underlined . 

3) Putative Start (ATG) and Stop (TAA) codons are in BOLD. 

4) Introns are tentatively assigned and are shown in lower case. 

5) Exons are 



GGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTCCCTCACCGTGACGTGCTAGAAATGGTTCAACATACTCGT 
CCGGCAGAGCGAAGACGACGAACAGCGGAATGTCCCAGGAAATGTAATGAGATATCACAGCAAGTGAACCCAAACCG 
AGCTGTGCGCTTTGTGTTGCGCTTTAAAAATGGCCCTTCCTTCGCCGCATCTGCTTGGTTTCACACGCTTTCCCAGG 
AAATCCACTGACCACTGGCCACACATCAACGACCGGAGCGGGAGCCTCAGTGCCCAGCGAAGCMATAATTTGCTCA 
AAAAGTCACGGTACTCAATTAATTTGATTATAATCAATTTCGTGGCTTCCAACACACCCTTCTTCCACAATCCATCG 
CCGAGTGAGCGAGTATAAAGGTGAAGAAACGTACCTT GCGCTTGCTCACTAACTGAACC^^TTTCAAAAAGGAACA 

Sfti^lSffi^TGGTGTCAATGTGC 

[CATCCCGGTCGCCGTGC| 



TAAACCGCAACCCACAGCCGAAA& 







CGTATCTGCGGCGGCCGC 
CCTGAAGCTGTACTCGTCCT' 
ACCTCGTCGtacgtgggcgaggggaggggcaataaccttcccacttggtggatattttcataccttttccatgtgtt 

Igtaagtcattggtttttctagtttttgggggagttgtttaca 
ccataaccaccc ccgacg gtaacatttgatcgtcccgcgaaaatgtttgtacagAAAAATGACGACATC§igl 
GCTGGAGCGGTACACAGGGCGGGGACGCATGCTATCGATATCGAATCTGTGGCTCGGCGCCTTCATTAG 
TTGTGACCTATCGTCTGTTTGTGCCCGGGCGCGGCCTACCGTACGGCGTCACGATACC 

:TTACCTTGeCCGCCTGC r 
5GCCCTJ 



:gcgacctgcacgctc 

GCCACAGCGGCACGATGGCTTCC 
CACAAACAAATCATCCAGtaagtagacgctagtagactcgaccggattgcccttccctcggggaggggaggtttgct 
atttcgggatgcggcagcacgcatacacacaaaccggaagccattaattctcccgttttcatgcccgcacgggcact 
gggtcatgttt caca tccttccttcctttccaaacacacacacgcgcgcgtgcacgtacac 




~\CCAGGTCGTi 






5taagtaaaatcgaccgacgtgcggtcgctagtccgtctccggactc teat ttcgggactcaatcgt tec 
atctctcaat ag AGC AATC AGCTGGCACAGATGATAATGATTGGATCGTACATCTTCATGAT ACTCTCGC AGATGTT 
TGCCTTCTATTGGCATGCGAACGAGGTACTGGAGCAGGtaatggcgctgaagctgagtttggttgagcggttcgcta 
tagatcggctgtcttacattgttgtgtt^^ 




Itttggctgatcgatgctctgttcaatgaacatggcacagaaggctgtgta 
aatagctgttcattaataagttttttcagaatgtatcgtttttagttgatttaaacgcattgttctatgcaatggta 
gcaacaatagaccgcctttattaatccaagcttcctttaggattgatttttattttaagagaaagataaaccatttt 
tagtaaccaatttagttacaggaaccaaaatacagaatttattattattattattattattattattattattatta 
ttattattattattattattattattattattattattattattattataattattattattattattattattatt 
attattattattattaatattattattattattattattattactattattattataattattacttttattattat 
tattattattattattattattattattattattattattattattattattataattatgattattattattatta 
ttattattattattattattataacaataataattattattattatttattattaattaattaatttattattatta 
attattattattgttattcattattatacattattatcataataataattttattatgattattattattattatta 
ttattattattattattattattattattcttattattattattattattattattattaatattatttttaatatt 
attattattattattactattcttattataattatttttttttattattattattattattattattattattatta 
ttattattattattgctattgttattattattcttattattgctattgttattattattattcttattattgttgtt 
gttgttgttcttattattgttgttgttgttattcttattattgtttattattattgtttttttttattctctaatta 



ttccagtaatccataataaaaaataataaagtaaataaatagtaaatagtaaataattccagtaactgtagtaatac 
acaataatctctaagaattaaaattgcattttgtaatgaaatatgttgattgttcgaatagttcagaaaaacttaaa 
aatgcctcagcattaaacagttttgaggttgttcagggcatttagtttagatattttagtattttaaagcatttgtt 
ttcattactacaaaaaagcaaatttatgagtgaattactttcagttcttctaaacgcctatgtgtatgcaattacat 
5 aacaatagctctcttttttattgcatttttccttagtaatctaaatccaatctcttctttccctcttgca^VITAAA 
GTCGGCAACGTGTACCCGATGACGTTGGAAATGTTTCAAAAATTGCTCAACGTGTCCTACTCCTATTTCACACTGCT 
GCGCCGAGTGTACAACTA AACTTAACCGGTAAACAAACAAAAATCCCCTCATCACTATGCAAAGACAGCAAGCAGCC 

AAGCTCTTTCACCCAAATTTTCATGGAGGTTCCCTCTCAACCAACCCATTGAAGCGAATAAAAGTATCAGCAACCAG 
10 GCGACGGTGAAAAAACGCTGCATTATTGTGCTTGCTTCAGCATTCCAGCGAATGACTCTTAAACTTTTCCATTCAAA 
AGTCGCGATGCTCACGATACGGAGCGGTGTGTTGTTCGATCCGCCGAGTGCACTCGCAAGCCGGTGATGTTGCCGGT 
GGAAATGCACAGATCGACACAGCGATAGATAATCGTTTGTTCGCGTAAATGGGAGGGAAAAAAGTAAGCTGCCAGCT 
ACTTCATTTCCATGTTAATTGAAACTCAAGCCAACGAACATGCAGAACCCGGTTGGTTGTGTGTCTCCGCTCCGGGA 
AAGGTCTCTGCTCCGGGGCATGGATTCTTTCCCCCTCCGGGTGGTTGGGGGTATTGTTTAGGTTTTTATTTTACAAA 
15 TTCATATCCTTCCGCTTCCGCATCAGCCGACCCGGTGGGTGCGCCAGACAGATGTGCGGCGGGCAACAAAACTATGC 
ACGAACATGGCCAACAAACACAGCTTCTATCTCATCTCTGTGTCGCACTGTCTCGCTTTCCCGCTGCGTTGCTTGTA 
GTACTATCATTGTTTTAGTCCACGGGTTTACTTCTAATTCCATTGCACCACGCAAAAAGGCTCATCGTTTGCTCGTT 
C^GGTTGCAACTTCGACAAGCGCATGGTTGGGATACGAACAAAAAACCAACTACTCCACCCACTACTACTACTACTG 
CCACCACCACTAACAACACTACACTTGGTTGGGAGCTTGCAGACCCACAAGCAAACAACGATACAAGCTAGCTAGCT 
| a 20 GCTGTGTGCGCTCGAGTCAGCCGACGGTACAAGGTTTAACCGGTACAAGCAACTCCCGGACCGATCCCAAAACTCTG 
*», ACAAGGCACGGGGCCGCATCCGGCAGTACGGTCGGAAAACATGGAAATGTTTAATTAAAACTGTAATTGTCAATCGC 
5j TGCTACAAGTTGTGACACAGGGAGAGAGAGAGACAGAGCGCGCCCGATGGTGATGGTGTAAAAGATAGATACAGGAA 
Jp| AAGAGCGAGAAACATTGGTACGATTTGGTGTGGTTAGCAAATTTGATTTCCACTGATTTTGAGTGCAAATTTAATGC 
ATCGAAAATTTGCCATTCAGGGTAAAGTTGCTCGTGGACGGATCCCCCGGGCTGCAGGAATTCGATATCAAGCTTAT 
y |5 CGATACCGTCGACCTCGAGGGGGGGCCCGGTACCCAGCTTTTGTTCCCTTTAGTGGA 

ii n 

J Figure 4a continued 
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Figure 4b 

Anopheles gambiae odorant receptor 2 amino acid sequence (SEQ ID NO: 6) 



MLIEECPIIGVNVRVWLFWSYLRRPRLSRFLVGCIPVAVLNVFQFLKLYSSWGDMSELIINGYFTV 
LYFNLVLRTSFLVINRRKFETFFEGVAAEYALLEKNDDIRPVLERYTRRGRMLSISNLWLGAFISA 
CFVTYPLFVPGRGLPYGVTIPGVDVLATPTYQWFVLQVYLTFPACCMYIPFTSFYATCTLFALVQI 
AALKQRLGRLGRHSGTMASTGHSAGTLFAELKECLKYHKQIIQYVHDLNSLVTHLCLLEFLSFGM 
10 MLCALLFLLSISNQLAQMIMIGSYIFMILSQMFAFYWHANEVLEASLGIGDAIYNGAWPDFEEPIR 
KRLILJIARAQPTDGGKIKVGNVYPMTLEMFQKLLNVSYSYFTLLRRVYN 



15 
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Figure 5a 

Anopheles gambiae odorant receptor 3 genomic sequence (SEQ ID NO: 11) 



Features: 

1) Presumed Untranslated 5' and 3' regions are underlined , 

2) Putative Start (ATG) and Stop (TAA) codons are in BOLD. 

3) Introns ar e tentatively assigned and are shown in lower case. 

4) Exons are 



AAGCAGAACACATCAAGAAGCAATTAGGTGTGTCGTACGTTAGCAAGTAGTTCGCGAGGAGGAATAAAATAGj 




tacagtttgaaaattctgaagaa tgcatcttacttgccttacttgttgttccagl 

TTGTTGGACGCACGCAGCACCGAGAGCGCCCCTGCACGCACTGACGTATTTTGGCTACTTTGACGTTTGCACCTTTG 
ACAGCTGAAGGACAGGGTACAATTTTTGCTGCTGTTATTACGCGCAGCGCATTGGATACGAAAACATTGGCCACAAG 
TTCTACGATTTTAGCGTTTATTTACTGTTCGTAGCAGCTTTTTTCCaCAATAAACAGACACAATAACGTACCGACAG 
TATTCTTTTCATTGTAGGATAGAGAAGCCGCCGGCCAGCAGCCAAAACGCGCCGCAAAACGAAAGGCGGCACCACCG 
GGGGAAAAACACGGGAGCAAAACGAGAACAGAACGCAGTAAACAACAAAACCGGCCGGAACAACAACGGTGCCGGAA 

ACGA 



Figure 5b 

Anopheles gambiae odorant receptor 3 amino acid sequence (SEQ ID NO: 8) 



MPSERLRLITSFGTPQDKRTMVLPKLKDETAVMPFLLQIQTIAGLWGDRSQRYRFYLIFSYFCAMV 
VLPKVLFGYPDLEVAWGTAELMFESNAFFGMLMFSFQRDNYERLVHQLQDLAALVLQDLPTEL 
GEYLISVNRRVDRFSKIYCCCHFSMATFFWFMPVWTTYSAYFAVRNSTEPVEHVLHLEEELYFLN 
IRTSMAHYTFWAIMWPTIYTLGFTGGTKLLTIFSNVKYCSAMLKLVALRIHCLARVAQDRAEKEL 
10 NEIISMHQRVLNCVFLLETTFRWVFFVQFIQCTMIWCSLILYIAVTGFSSTVANVCVQIILVTVETY 
GYGYFGTDLTTEVLWSYGVALAIYDSEWYKFSISMRRKLRLLLQRSQKPLGVTAGKFRFVNVAQF 
GKMLKMSYSFYWLKEQF 
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Figure 6a 

Anopheles gambiae odorant receptor 4 genomic sequence (SEQ ID NO: 12) 



5 

Features: 

1) Putative Start (ATG) and Stop (TAA) codons are in BOLD. 

2) Introns are tentatively assigned and are shown in lower case. 

10 GGGGAACTCCCCCACCCGACCAGACGACGGAAAGCTAACGATGTGCAATTGAATAGTCATTAGT 
AGCGTTTTTGCTCGCAAACGAACTAACCCTTTGACTTTTTAAGTTCACTACGGTGAGGACAAAAA 
TCAATAAATTAAATCGAGACCGTTGATGAGCAAAAGAAAAAAAAATATTTTACTGATTTTCATTT 
CGTTCCATCGACTACATAATCATAATTATATGCCACATTTTATTATAAGTTTTTGTATCATTTTTA 
AACAACACAAAAATGCATCCTTTCGAATATTAGTCAGGTTGTATCAACAATGAAGTTTGAACTGT 
15 TTCAAAAATATTCCTCCCCGGACACGGTCTTATCCTTCGTGCTAAGGCTTTTGCATATCGTGGGC 
ATGAATGGGGCAGGATTTCGGTCGCGAATTCGAGTTGGTGGCATTTTTCTGTTCTATTTAATCTT 
M, TCTTGTAATACCGCCACTAACGGGCGGGTACACCGATGGTCACCAGCGTGTACGCACCAGTGTG 
Ci GAATTCCTGTTTAATTGCAATATTTACGGCGGCAGTATGTTCTTTGCCTACGATGTGGCCACTTT 
;F: CCAAGCGTTCATCCAGGAACTGAAGAGCCTTTCGGTTTTGGgtaatatttaattaattaaaattgcgtttattgcat 
iffiO catcatttgtttctctttgcagTATGCTCACATTCGTACAGACTAAAGTATAAGCTGACCCGGTTCAACCGTC 
m GAGCGGATATTATCGCCAAAGTGCAAACGACCTGCATGGGTGCTGTAACGCTTTTCTACTGGAT 
T» TGCACCGATACCTTCCATCTGTGCGCACTACTACAGGTCGACCAATTCCACCGAACCCGTGCGG 
ft TTTGTGCAACATTTAGAGGTGAAGTTCTATTGGCTCGAGAATCGCACCTCAGTCGAGGACTACAT 
iri AACCTTCGTGCTGATCATGCTACCCGTCGTGGTTATGTGTGGTTACGTATGCAATTTGAAGGTGA 
"25 TGACCATCTGCTGCAGCATTGGACACTGTACACTGTACACCAGGATGACTATAGAGATGGTAGA 
J«, GCAGTTGGAAAGCATGGCATCAGCGGAACGAACTGCCAGCGCCATACGCAACGTGGGGCAGAT 
fT GCACAGTGGTTTACTGAAATGCATTAGGCTTTTGAACACGTCAATCCGATCGATGCTGATGCTGC 
IT! AGTGGTTGACCTGCGTGTTAAACTGGAGCATTTCTCTCATCTATCTAACGAACGTGgttagttttgtctt 
3 i! gtttggaaatccaaaaacaaaaagatggctataattgaactttctattacagGGCATCTCGCTACAATCGGTTACCGTGGT 
feO GGTAATGTTTTTTCTTGCCACTGCGGAAACTTTCCTGTATTGTTTACTTGGGACGCGGCTTGCGA 
CACAACAGCAGCTGCTGGAGCACGCACTCTATGCTACACGGTGGTACAACTACCCAATAGCCTT 
w TCGCAGCAGCATTAGGATGATGTTGAGACAGTCGCAAAGGCATGCACACATAACGGTGGGGAAG 
TTTTTTCGCGTTAATTTGGAAGAATTTAGCAGGATTGTCAACTTATCCTACTCTGCTTACGTCGT 
ACTTAAGGATGTAATAAAGATGGATGTACAGTGAATGTTTTTTTTTTTGGCTTGGCAACGAATGA 
35 AGTTTTCCGAATCTATATTAGATCTAGAATTTAATCTAGATGTCATAATATGATCTTGGCCATGA 
C CGGTTC CTGGTTTTGGAAC CAATTCTCAAAACAATTTTGAACTTAGGGCGAGGCATGAAATGTC 
CCAAGAACCTATCCAAGTTCTGGAACTACATATTACCGAATCTATCCCATTATTGCCTCGGAACT 
GGTTTGGTGCTAAATATTTGTCCAAATGTTGGTCCTGGACCTATCCAGACAAAGATCTTCAATTA 
TTCCTACCACTGGAACTGATTAATTGATGTAGGAAGTCATGGAGGTGTTCAGGGAGAATTTAAA 
40 CACTAATGTTCCAACTCATTATTTCAAGGGCAATTCTATTTTTTATATGCCCCTACGGATTGATAC 
GTATGTATTACTCCATTTCCTGGACTTTGTCTTATTCTTGCTGCTGATTGGACGTGAAATGTTGA 
GAAAAAGATTCTTATTTATGAGTGATACAGAGCCTTTAAATACTCCTACGTTGTTTGCTATTTAA 
GTATGGCCAGGCTAATCACAATCGCTACTAATGAACAGAATCTCTTCTAATTAAACCCTTTCGAT 
TGATAGTGTCAATGTCAATGTCGAGATAATTGAACTGCAAACgATACCTACCTTAAACGGAGCAG 
45 AACACATCAAGAAGCAATTAGGTGTGTCGTACGTTAGCAAGTAGTTCGCGAGGAGGAATAAAAT 

AG 
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Figure 6b 

Anopheles gambiae odorant receptor 4 amino acid sequence (SEQ ID NO: 14) 

5 

MKFELFQKYSSPDTVLSFVLRLLHIVGMNGAGFRSRIRVGGIFLFYLIFLVIPPLTGGYTDGHQRVR 
TSVEFLFNCNIYGGSMFFAYDVATFQAFIQELKSLSVLVCSHSYRLKYKLTRFNRRADIIAKVQTTC 
MGAWLFWIAPIPSICAHYYRSTNSTEPWFVQHLEVKFYWLENRTSVEDYITFVLIMLPVVVMC 
10 GYVCNLKVMTICCSIGHCTLYTRMTIEMVEQLESMASAERTASAIRNVGQMHSGLLKCIRLLNTSI 
RSMIJ^LQWLTCVLNWSISLIYLTNVGISLQSVTVVVMFFLATAETFLYCLLGTRLATQQQLLEHAL 
YATRWYNYPIAFRSSIRMMLRQSQRHAHITVGKFFRVNLEEFSRIVNLSYSAYWLKDVIKMDVQ 

NVSYSYFTLLRRVYN 
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Figure 7 
ANOPHELES GAMBIAE 



Preferred DNA Codons 



Amino Acids 


Preferred Codons 


Alanine 


Ala 


A 


GCC 


GCG 


GCT 


GCA 




Cysteine 


Cys 


C 


TGC 


TGT 








Aspartic acid 


Asp 


D 


GAC 


GAT 








Glutamic acid 


Glu 


E 


GAG 


GAA 








Phenylalanine 


Phe 


F 


TTC 


TTT 








Glycine 


Gly 


G 


GGC 


GGT 


GGA 


GGG 




Histidine 


His 


H 


CAC 


CAT 








Isoleucine 


lie 


I 


ATC 


ATT 


ATA 






T ,vGi n p 




K 


AAG 


AAA 








Leucine 


Leu 


L 


CTG 


CTC 


TTG 


CTT CTA 


TTA 


Methionine 


Met 


M 


ATG 










Asparagine 


Asn 


N 


AAC 


AAT 








Proline 


Pro 


P 


CCG 


CCC 


CCA 


CCT 




Glutamine 


Gin 


Q 


CAG 


CAA 








Arginine 


Arg 


R 


CGC 


CGG 


CGT 


CGA AGA 


AGG 


Serine 


Ser 


S 


TCG 


AGC 


TCC 


AGT TCT 


TCA 


Threonine 


Thr 


T 


ACG 


ACC 


ACT 


ACA 




Valine 


Val 


V 


GTG 


GTC 


GTT 


GTA 




Tryptophan 


Trp 


w 


TGG 










Tyrosine 


Tyr 


Y 


TAC 


TAT 









http://www.kazusa.or.jp/codon/cgi- 



5 bin/showcodon.cgi?species-Anopheles+gambiae+[gbinv] 
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Figure 8 



Name 


SEQ ID NO 


FIG. Reference 


Arrestin 1 (cDNA) 


SEQ ID NO: 1 


Figure 1 


Arrestin 1 (polypeptide) 


SEQ ID NO: 2 


Figure 2 


Odorant Receptor 1 (cDNA) 


SEQ ID NO: 3 




Odorant Receptor 1 (polypeptide) 


SEQ ID NO: 4 


Figure 3b 


Odorant Receptor 2 (cDNA) 


SEQ ID NO: 5 




Odorant Receptor 2 (polypeptide) 


SEQ ID NO: 6 


Figure 4b 


Odorant Receptor 3 (cDNA) 


SEQ ID NO: 7 




Odorant Receptor 3 (polypeptide) 


SEQ ID NO: 8 


Figure 5b 


Odorant Receptor 4 (cDNA) 


SEQ ID NO: 13 




Odorant Receptor 4 (polypeptide) 


SEQ ID NO: 14 


Figure 6b 


Odorant Receptor 5 (cDNA) 


SEQ ID NO: 15 




Odorant Receptor 5 (polypeptide) 


SEQ ID NO: 16 


Figure 9b 


Odorant Receptor 6 (cDNA) 


SEQ ID NO: 17 




Odorant Receptor 6 (polypeptide) 


SEQ ID NO: 18 


Figure 10b 


Odorant Receptor 7 (cDNA) 


SEQ ID NO: 19 




Odorant Receptor 7 (polypeptide) 


SEQ ID NO: 20 


Figure lib 
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Figure 9a 



Anopheles gambiae odorant receptor 5 genomic sequence (SEQ ID NO: 21) 

Predicted Exons: ITALICIZED, UNDERLINED AND 
Introns: lowercase. 



10 



15 



:: - ■ 



go 



1=25 



tctagacttgaacccatgacgggcattttattgagtcgttcgagttgacgactgtaccacgggaccacccgtttatcactatcactatt 

aattaattataatatgcttttgtagcgatcagcctaccgggttttgtttctctggatatcttaagttcccatttgattatcaagatagaa 

caacaacttgtaccttaaataatcattacgtacccttaatcaacctgtgcatcaaggagttttcgcgaaagcaaaaatccgattgtct 

gatgttgtcttgattccatccgattcgttactggttctgcaaaatcgtccaataatacggcaatgtccttatcgatgcttgaatcaacat 

cacattgtttgcatttcgttttttgcgtgcaaatatgttatttgcaaagaaggcaaggtaatgtgcttaagagtaaatacaattcgctg 

tccattttttgtccaccagtgtgccagaacccgtgcctt.ttagtccttcgaatacatccgaccagtcagcaagcaagtgcatcATGG 

aftrT4 cm a a ggtgtccga a ccgtacgccgtgatgccgc ttctactacgcctgcagci 



VG1TGG 



AGCTTCTG 
iTGGTTC< 




GGTGAACGAC 



TTATTC 




CTAC 




gcgg: 




CTTCGGCTA TCCA GA T1TA GAGA C 



agctgattttcgaatggaacgtactg 




"G TTTTCTCTCAA G CTA GA C( 



GTTCCC-m 



ataatgattgataaaaggaacctttgagcaactcctatccctttcaagi 

WJGACTATCTGGTACGCATCAATCATCGTATOGM 



CAAGA TCTAC 




CCAGCACCTACCTAGCGTACi 







'GGCCATCTTCTACTGGGTGG 



'AGGAGGAGCTGTACTGGTTTC. 





AGCGTAC 





35 



40 



AAAG( 




iCAACGTGAAGTACTGTTCGGCAATGCTa 



GCGATGAGAATCCAGTTCATGGACCGGCTGGACGAGCGCGAAGCGG 



30 ctagctgctttca 

TlVATACAGTGCGTAATGATCTi 




aaggtctgccggtatgttgtggat agaatacattt 
VATCT TT CGGTGGGTfTTT&FC 




TACGTCGCCGTTACG ztaacta 



aaagcactgtagtgatctgtctgccacaccattcactgctgtgtcttgttttgtcactcttccca 

iCAAACGTGGG 




JAGTGATCTTACCTi 



GTTATmGCTGA stttcasttacttttccettcccc 

***mmMmmmmmmmniii*tm ummmmmrmm B 5-<. 1 ,. -,- ,<.. ■, !tt? ... — ■ ... 




tctaaccgtaccacttgtaccatttgtttgag 
TATCGCCGTTCGGTTTCGATTCAACGGAA* 

AACCGGTCGG 

llgtatggggagaccttccactgtg gcaag aaagattt^ 

jaggggaactcccccacccgaccagacgacggaa 
agctaacgatgtgcaattgaatagtcattagtagcgtttttgctcgcaaacgaactaaccctttgactttttaagttcactacggtgag 
gacaaaaatcaataaattaaatcgagaccgttgatgagcaaaagaaaaaaaaatattttactgattttcatttcgttccatcgacta 
cataatcataattatatgccacattttattataagtttttg 



45 



15 



15 

:: . 



Figure 9b 

Anopheles gambiae odorant receptor 5 amino acid sequence (SEQ ID NO: 16) 



MVLPKLSEPYAVMPLLLRLQRFVGLWGERRYRYKFRLAFLSFCLLWIPKVAFGYPDLE 
TMVRGTAELIFEWNVLFGMLLFSLKLDDYDDLVYRYKDISKIAFPJtCDVPSQMGDYLVRI 
NHRIDRFSKIYCCSHLCLAIFYWVAPSSSTYLAYLGAPvNRSVPVEHVLHLEEELYWFHTR 
VSLVDYSIFTAMLPTIFMLAYFGGLKLLTIFSNVKYCSAMLPXVAMRIQFMDRLDEREA 
10 EKELIEUVMHQKALKCVELLEIIFRWVFLGQFIQCVMIWCSLVLYVAVTGLSTKAANVG 
VLFILLTVETYGFCYFGSDLTSEASCYSLTRAAYGSLWYRRSVSIQRKLRMVLQRAQBCP 
VGISAGKFCFVDIEQFGNMAKTSYSFYIVLKDQF 



H 5 i 
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Figure 10a 

Anopheles gambiae odorant receptor 6 partial genomic sequence (SEQ ID NO: 22) 

5 These are the predicted last three exons of another candidate Anopheles 
gambiae odorant receptor. 

Predicted Exons: ITALICIZED, UNDERLINED AND 
Introns: lowercase. 



10 



145 

G 



3 5 i 



Mi 

i 5 



aacacccatcttatcggcaaaattagtatttaccgtttgaaagcggcttcccttcctggctgtttctcactctctctctctctgtctctctta 
t.tgatgccgtatgcgccgcgtgctataggctag rrArGCri:4CCGGATGrrGCGA rCG CGCACGrGCrrr 

W&CGCATACGCCAGTGCACACTl'GATGGCGGTGGTGATGACGTC 



:TGCTCGTGAGTCAGACCTTTTCATTTCCTG( 



[cagacggttagacggatatatgctgg: 



mm 



;tgatgagcjgagtcgcgagtaattgaagcttttgcttttaaaacacatcaga 



irCG gtaagj 



;caccaggc 



ICGG' 



n GGTACCGG( 



ggtcggtggci 



UTCCGTGCTGCj 



mccA( 



ITGATACTGA CCGCAi 



;agttgccaat tgattgccg tttgcgttaat atttca gtaagagtgcgctctttcccttag 

~|gtaa 



gcgcgagagagagagagagagcagtatcgttcaccctttggatgaatcaatagatttctaatcatgaaccattgaaaaatgaatca 

acattttcgctagttgcacaatattgtaccattctatacagcttcaccacgaccaagcgtttgttgcatcaggaccaaacacgtttcga 

caagccgcgtcacctgctggc 
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Figure 10b 



Anopheles gambiae odorant receptor 6 partial amino acid sequence 
(SEQ ID NO: 18) 



LCLPDVAIAHVLFRIRQCTLDGGGDDVCCAPFSARESDLFISCNILFLSRPHRRLDGYML 

VKFVLFMLCFLffiLLMLCAYGEDIVESPWGDZCRLRLRMWGRVGGVPSIRAANYTPQP 

AVRHTDRMENLAHPNEYFQSDPASFLVLLYPPEDRLRE 

10 



r 



s-; 



TOSS' 
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Figure 11a 

Anopheles gambiae odorant receptor 7 genomic sequence (SEQ ID NO: 23) 



10 



Features _ _ 

1. Predicted Exons (7): ALL CAPS. ITALICIZED. Ul 

HIGHLIGHTED 

2. Introns (6): lowercase 

3. 5' and 3' sequences: lowercase,, dotted underlined 



15 



3 i! i 

5 Til' 



30 



35 



40 



45 



ccgccc.gggcaggtgacttacg^ 
tatcacgctgcgcttaccagcgcctg^ 
gcacsgttgcatcgctaaggtttcsgt^ 
gcattgtjgtttagtgagaa^ 

gatgccgaacattcgggttgatgcaggccagcggtcaactt^gtto 

ACGCGGACGACGTGAACGAGCTGACCGCCAACACGATCACGACCCTGTTC1TCACGCA 




GCCA TC TGGAACCAGA CCAA CAGMk 



Mm : 



'GC GGTCAACTCGGAGAACTTCTA 

CGAATCGGAi 



1ACGCTC 



^gtatgtgtgtatgtgtgtggccgtttgggaaagtgtctttgcggcagaaccccaatctactgttacgc 
ttgactgggtttttgtttttttctcggtggagggacgggataaaatatctgaaagaataattgagtcaacccacagggggatgcaag 
acatcgcaggcagagagtttgggtttgatttatcaccgcacaccgaatatcttcacggttcataagcttcaccgcggtgaaaaggga 

actccccatttasctrttttettttttttcttcctctcgataaattacte 

„._,-.,,..„,.-., — - _ _ . 

cctactag -CCTGC 




rGAATgtcctgcgcgtcacagttggcagtcagtgagcggcaacacggcgaaaaaatgggactaaaaccggtcttcacaga 
gccaacacattcctacagcaattgcataccttcgggcggtcgggactgggcaatgcagctacaacatcctcgcctaaagttatgcaat 
tcgagcgacaaatgttgccgtgttagggctttttgtgataatagtcgtttttttgtcctctcgcttatcaaactctatcaacggaggaaa 
tccattttcgctacaatgcctacagctcaagtttcaaggtcaatcgagcgggtggggatcaacttttttattcattttgctaacgcccca 
tcaacaaattctatgttctcaatggcaaagattactgcccgcaccaatcgcccaacgaaacggcaaaagaaaagcgacgattatga 
agatgtccaaaccattgcccgcccgacgctttatctgatgatttgcgggatggcttttacttgtctgctactttcaggcacaaaaggaa 
atgaaaccagcgcaggctcgtttgccggcttgcggaggttcttcaggcactgaggctgagtacttaaatcgaacgatttttacgattc 
tggatccagttttatgatgtggcctgcattacagtggcaattataccctgatgttcatttcattgcattttgtaagtttgtgctggtaacg 
cccgtaacgattaattcttttcaaagagattctttcaaagagattcaaaatgtgtataacaaatgctaacgaatggaccgtacttgg 
agggttgcggaaagtaacgttttaaaatattcatcacaatcctctgcaaacttgtgcttaattaattggtgcacaataagtttaaact 
gtggcggcagatgtgtcgctgtccgcttccttccttcccagcaagctcgtgcgaaataatttattccatcattttaatacagccgtttgtg 
cattttaattagcaaagcaatataaaaagcagctaaccatccccattaaaacaaagtgcttccgggcccaattgttatggcggtgga 
aagtaatggttttaccagtggaagtgtcctttcccatcgtgggtacttcgcgatattcttgtcttatacaagtgcatacagaaaaaaa 
ggacaaatcctccttgctatggtctaaggccagcttcggtaccgcttccgcttcgggatgtcataaagtttgatgggtgtttttaacatt 
acttccgctcttaaccacctaatggacttttcatgcttgagctaaagttaaaccagccaccagcggtacgcaccgagccacggttgatt 
tcggcggcggcctcatccccagttttgcgccaccaatattgccttcattaatctgtaccctcggagcgttagggcccgcggacgagtcct 
cgttgtaatgcaccgccatgccacgggacgggataatccgttgggacggcgcgaaagcgactatcgcggacggattggttcgaccg 
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tgctacaacacattttatgcttcacagatttacttcctgctgttttcgatggtccagagcaacctcgcggatgtcatgttctgctcctggt 

tgctgctagcctgcgagcagctgcaacacttgaaggtaggtacggtagcaaacgtggttgtctttacatccgcgtgcagcattatcct 

tatcgacgtgtagtgttaacggtaaaagaggaagcgataaaaaagcaaeatt^ 

ctctctctctctctctctct^ 

:CA ACTCTTCGCAACTGTTCCGAGCAA 



IGTTCCAAATCGGAGCTGATCATCjiACGAAGztatztzaR^ 



gactcaaagagagcataacacaatcccctggtagttcatttcaatgaccttaacactcggcaagctaagcgagacagtggggacag 
tgagaaagagagaacaagaaaaaaaaccatcatccgtacgacatcatcgctacgtaccggtatttcaggatgaggaaataaaac 
gctaggggaatgaaagtgcgacagaatgataaaacaatccccacccaggcccccagcctggacgaacggatgtagtgtgcgaagc 
gagcaaaaaaagtcaaataaattgaagtttaaaaatagattttccccgtccatccgtggtggagcgtaaagcccggcggacaactt 
cgagcacggcgaccgtgcacagtactgtgccacagttgtagggacggataagctccgttccttttttatcctttttttttggagatttgt 
ttgcgttcgcatcgttagacgagcttagtgccgtgttgctctaattgctatttattataaagcgcttccaaatagaagatcggttctctc 
catttaatctatcgcgcctgtacgcctgaaactatgcactgtgctgtgaaaccgtcaagctcgagcacgacgaatggcccaccgtacc 
acgcccgtggtgcccaaagcgcaacgcgaattgcatgttaacaaacctttgcctaccatccaatccgtgtgaaattgcccgctctcttt 
ctctcttttgcgctttcggtgtatcgaacggttttgtcccttttttttactttgctcttgatctcttgctgtgctcactttcatctc 
ctgacggtggtgggttttcgaaaaaagagcgatttcttctgcgtgtgtgtgtggtttttttaaataaccgctccaggtcgtgttgaacg 
ctgcaggaccgatcggagctagtttattatcagctttagtgtttatcccacccatgccccacatcacgtctgtggagagtgggggaag 
cttaagtccaatgtaatttaccgtgtttctgtcgttcgtcaccttcttcgtcgatggagattggtgcggttggcacgataaaagcccact 
gcacgttacggaccgagggaaaggtctttttgtaggcctagcaacggtcctcattcaccgcatgggggtgtagctcagatggtagag 
cgctcgcttagcatgtgagaggtaccgggatcgatacccggcatctccaacccacacaaaacgttttttaagaagatttttagggaa 
gatattaacgcgggtacactgtgctcctctaagttggaagagtagatgagatgatgacaagggagaaggaacatgtgtacgtgttt 
gatagcaaacacacaaacaacaatatcatctctgataataatctgatgtgtgatgtgtgtgtattgttgttatgctgcctttgccatct 
tgtccctctctctcctgttcaactcctaaaagaattgtttggagtcctctcagttcctcgtaaagatcctttcgagattcttctttccttttt 
attatttattccacgagcctctgacataagtagccttccgcttatttccttctccttgcacttgtcagttccgtgtagagcgtcattttgag 
gtttacacattt^ 




gtaggtatggtaatttctaaggtgtggtgtaaag 
cctccaggttccatgaaaaagggatactttaccacagtaagagtttgttttgctggacttacattctttggagcattgtttggtgttgtg 
ctgaaaccggttgcaatatcgttttgcgaagaaattatgtgtaaagcgtattacaatctcattcctctgttaatctgtaccaattgtgtc 
agccccgaccgaaagcaggcctaattcgtaccagaaaaaccacaagctgtttgtaagcatcgatacgcccgaagctttcaatccagc 
caaggcgccacctactattgacgtgactttttgcacgttcacactctccctctcccattctttctataaccaatcgtcgctcagccagcat 
cgcccggagtgaagtttttatttgaacgatatcacccgtatcgattttccactaaacatgcttaaatcgtttcacaaagctcccccaaa 
atcccatttcaccaatccaccaatttgaagtccgtcgtcctttgtgtccttgtgtttgtgtgtttgtgtgagctggagacatgggggagt 
gagtaaccgaacaacctcttgccgctgcttcacgatatcgaacagcaccaagataagcatccctttttccctagccgatgtctccgata 
tctcgattccgcttccagcgaggcaaagaaaaaggcgaactggctgacctcacccggggcgaggaaaaagcgtagggattacgtc 
gagcagcacgagttgtgatttcttcttcttctggttccataaatcgctgacggtttccattaccgcctgcggagtgcacacacgtgaag 
ggaaagcgaaaacgtttagattccagcagcaacggcagcaccagaagcagcagcagcgcggcaaattgaatcatcctgacgcgat 
gagttgtctgggttttcgggtcggtggcttacagcaccacaccatctgctgcagctaatacagctgtaaatttcgttagacatagactt 
gattttacaatattacacacacacttacacacacagctatagatttgtcgcttggcgtatggctctgtacggcgtgccgtacatgccgc 
gagccgtgttgctgctggttgcgatacggatcacgtccgattcgattcagcctgcgtgtttttggtgaagatccttatcggtgacccact 
ttcagtgtgtcgagagcgagggtcactatggcgcctgtcagttggaaagctaggctcgattcaaagggccattgtgccagtgttcttt 
ttaagatagcgataagcttttgatcgaaatagtaaatcaaacattgtttcttttttcctattccaaactgttgccaacctcattattacg 
tttttgcagcgggtgtatagtaaattgcatactttaaggcgtgattttcaaatgtagcgttccgtatgcagaaacgccatggattatgc 
aatttaaacaatgctgcttccttaacattcaaataacggcttattaaggaactttttgtgcaatttgtttttaacagcaaatagttagc 
tcagaacgatcacatttagtatcgcttcaacaaagaactcttttaaacacacaatttgtaatgccattccctcgagaaagtttcttgtc 
agtcctcctctgcatcacagcaacaaccaaacctgctcatgtttcctgctcgtttcctagctgttttgaacgttatttccgattcctgtgct 

20 



tgcccgcttttcttacaatcaaccacaatggttcagatttcgctcttattttattgacccactgctttcgtgctgaagcccgtggaaacaa 

tgcgccaagctcagcatccagccatgcatgtaaaatgagccacgcgacagattttagacatcgctttcgctctgcaccggaggtggtt 

ttattcttgtttccgattcccacgtccattcgtcctgggtccgtccgccgggcccgaaaccgtaagccgtgcggggaattacgcaatcga 

aacgagccagaaaatgagcacgccaaatgcaaagaaaatccccttttgagtggtgctcctgccaccactcatctccccaactggtgg 

gtgaaaaaccttgtgcgccccttctctttccagaaaaaaaacgcctcgctcgcacaaaaacatgctcgcccggtgaagctgcgtatgt 

cgcagaagctcaaaccaacgccgccagcaagcatcaacaatttctattcaaacacccaacgcagcgcccaaaccgggtgcactgta 

ctcagtagcgaagatgctcagattgtcccgtgcgctgctttcgatgcccgtttcggagcgggaagccatcgcttgccaacgttggcgat 

gtcttttagccgtggatttgaattttctgaatatcacaggcgggcgcggtttgcctgcaaggttgttgcttcccacacgagcattgcttt 

ccgtaccgcggtggggcgagttttcaacgcaaccttctacaagcaacgccacaacgcctgggagcgatatttaacagaaacaagaa 

catcccgaacttcagcacatgccgtgatttgcctgttggaaaagcttttgtgagcgtgtgagttgaacgagctctattttcccagcgat 

gggtggcatttgtgtggcatgctatcgtcagcttttcttgaatctttacctctccattcgcctccattagtacacgcgtatggaaaatgg 

gtgcaacggatcagaacggattttccgcgacagacttaataaagggaaagcaacgcgttttttgcatgtgtagtgtttatgagcttt 

atgccgttactttgcaattaaaaatagcaaaaaataacagtttttttttgtaagcggattacaaagaatgtatcagaatattacgtg 

aaacattcatttcatgctgttaacgctcaaatagaatagttttgtaacacggattgcataccttgccggtatcggttacattttcgccta 

acagtatgcaatctgtttagctttgttgtttaatgactgcgttggtagtacaatatttatttacaccgcgtaatttatctcacaaattgc 

aaaaaaatgtcaatctgtatcgattattcacacaaatcagatcccggaaccagtgtagcccaatgtgctcttattgaattaccacga 

acaaatcaacctgatgcccgggtccgttggcaaacagcttgcgccgaagccgctcagtgtttcgtgcactaccgtgctgccattttgct 

gccctcatcgaacagataaacagaagggcaactcttgtgagcatcgcaatgcccgtctgaagttccgtcgaaaatgggcctaaattc 

aatttgacgcatttacccgcgaacaattgcgcgaaggctgtcaagtgtgttccacgaactgcgacaacaagcacacacacaaacac 

aaatgttatcgtttcggcatgtttctcggtacaaagcgtgtggcgctatgtggcatgccgattcccagacagagtgatcgatagtaaa 

tgtagcctatccggtagcattcaatttccttttctatcctcgcaaacaaagcccattctggggaggcgtggtgaagctttcaaaggcat 

tgtgaaacaaatgtcctggttcggagggatgctggggaaagcaaacacggtgccgccatcgctgctaccgtcaatcgatcatgcatg 

atgtgattaatatttgtgttattcacctgcgtatctatgcgtccgtcgtgtcgttcggatttccggaagtcaaggaaaaagcgactcca 

tttgggattggtttttgcagcgaaaaatcaaaacattcgcacaaaacc^ 

vTGGTGACCTCCACCATCAAGCTGACGCTGCTCGCCTACCAGGCAACGAAA 




Igtacgtgcgctcggcgtg 
ttgccgtgggaaagcattctccctgccccatatcgctt^ 
ccgctgccatctcggcttctgaatgttttcacttctcccatacttctcccj 

CTATTCGTGCCACTGGTAi 
GCAGTGCCAGAAG 





gtaagtgtagcctggtggctggcacagaacaggctggcaaaacagggactttggctctagc 



ctgatgggtggtatatgtgtgtctattttttgctaccattctcgcatcccttcctttccag 



gttgtttgtttgtgc^ 

acagaMttgcaaaatgatt 

t£gtaM.a£t^ 

gtgtgatttgcsctcatc 

tcgtttc^cgtttta^ 

aatagag^atcgtttt^ 

ttctgtgagaattatt^ 

ctteaaacate^ 

actgcagaaaa^ 

ttcgcatcgagatgga^^ 

tacgcacaa<^ 

pagcaaaaaaatacatat 

agMaaaaaaaacactt 
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gtacggcaccgcatec^ 
ttggttctgtg^ 

gcggcstgctegtggggccMaagaat 



5 

Figure 11a continued 
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Figure lib 

Anopheles gambiae odorant receptor 7 amino acid sequence (SEQ ID NO: 20) 



MVLIQFFAILGNLATNADDVNELTANTITTLFFTHSVTKFIYFAVNSENFYRTLAIWNQT 
NTHPLFAESDARYHSIALAKMRKLLVLVMATTVLSVVAWVTITFFGESVKTVLDKATN 
ETYTVDIPRLPIKSWYPWNAMSGPAYIFSFIYQVRWRNGIMRSLMELSASLDTYRPNSSQ 
LFRAISAGSKSELIINEEKDPDVKDFDLSGIYSSKADWGAQFRAPSTLQTFDENGRNGNP 
10 NGLTRKQEMMVRSAIKYWVERHKHVVRLVSAIGDTYGPALLLHMLTSTIKLTLLAYQA 
TKIDGVNVYGLTVIGYLCYALAQVFLFCIFGNRLIEESSSVMKAAYSCHWYDGSEEAKT 
FVQIVCQQCQKAMTISGAKFFTVSLDLFASVLGAVVTYFMVLVQLK 
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